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DISCRIMINANT  ANALYSIS  OF  RESPIRATORY  SOUNDS 
OF  PULMONARY  INSUFFICIENT  PATIENTS  AND  NORMAL  SUBJECTS 


INTRODUCTION 

Respiration  is  one  of  the  physiological  functions  of  concern  when 
examining  or  treating  a  patient.  A  clinical  relationship  between 
respiratory  sounds  and  gross  respiratory  pathology  was  established  in  the 
nineteenth  century.  Auscultation  of  respiratory  sounds,  however,  is 
subjective.  Due  to  the  subjectiveness,  there  are  varying  degrees  of 
acceptance  of  respiratory  sounds  as  a  clinical  sign.  To  alleviate  the 
problem,  various  researchers  have  studied  respiratory  sounds  to  explore 
and  develop  automated  methods  for  analysis  and  diagnosis  of  pulmonary 
diseases . 

The  objective  of  this  study  is  to  determine  whether  respiratory  sound 
data  of  normal  volunteers  and  pulmonary  insufficiency  subjects  can  be 
classified  as  normal  or  abnormal  pulmonary  systems  by  discriminant 
analysis  and  if  the  spectral  parameters  for  discriminating  respiratory 
sounds  correlate  with  clinical  parameters  obtained  from  the  pulmonary 
function  test.  Patient  data  was  obtained  from  the  U.S.  Air  Force  School 
of  Aerospace  Medicine,  Brooks  Air  Force  Base,  Texas.  The  classification 
(normal/pulmonary  insufficiency)  of  the  subjects  was  determined  from  a 
single  pulmonary  function  test. 


Background 

The  process  of  respiration  is  of  vital  importance  to  life  and 

includes  the  following  mechanistic  events:  (1)  pulmonary  ventilation,  the 
inflow  and  outflow  of  air  between  the  atmosphere  and  the  lung  alveoli,  (2) 
diffusion  of  oxygen  and  carbon  dioxide  between  the  alveoli  and  the  blood, 
and  (3)  transportation  of  oxygen  in  the  blood,  principally  in  combination 
with  hemoglobin,  to  the  tissue  capillaries  where  it  is  released  for  use  by 
the  cells  according  to  their  metabolic  needs  [17]. 

The  foundations  of  respiratory  medicine  were  laid  at  the  beginning 
of  the  nineteenth  century  when  Laennec  [20]  established  the  clinical 

relationship  between  respiratory  sound  and  gross  pulmonary  pathology  by 
the  use  of  the  early  stethoscope.  Auscultation  in  respiratory  medicine, 
however,  has  advanced  slowly  since  Laennec  established  auscultation  of 
lung  sounds  as  a  means  of  diagnosing  the  condition  of  the  lungs. 

The  slow  progress  is  due  to:  (1)  a  variety  of  human  factor  problems, 
(2)  the  limitations  of  the  instrumentation,  (3)  the  lack  of  total 
understanding  of  the  mechanism  of  production  of  respiratory  sounds,  and 
1 4  >  the  lack  of  understanding  of  the  origin  of  the  source  of  the  sounds. 

Human  Factor  Problems 

Respiratory  sounds  heard  through  a  stethoscope  can  be  roughly 

classified  into  two  main  types:  First  are  normal  respiratory  sounds, 

1 


these  are  both  inspiratory  and  expiratory  sounds  heard  as  the  air  moves  in 
and  out  of  the  chest  during  normal  breathing.  There  are  two  types  of 
normal  respiratory  sounds.  Tracheal  or  bronchial  respiratory  sounds  are 
heard  by  placing  the  stethoscope  over  the  trachea  and  listening  as  the 
patient  breathes  in  and  out  with  the  mouth  open.  The  sound  is  described 
as  "tubular"  and  is  similar  to  the  sound  that  arises  when  air  is  blown 
through  a  tube.  The  other  major  type  of  respiratory  sound  is  called 
vesicular.  The  term  "vesicular"  in  Latin  refers  to  little  vessels.  This 
description  refers  to  the  sound  that  is  heard  over  the  majority  of  the 
chest  of  normal  persons  during  normal  breathing.  The  analogy  used  is  the 
sound  heard  by  the  rustle  of  wind  in  the  trees  [9]. 

The  term  "adventitious"  is  used  to  describe  sounds  not  expected  in 
the  normal  chest.  To  complicate  matters,  the  terminology  of  adventitious 
sounds  is  not  standard.  The  clinician  attempts  to  describe  the  quality  of 
sounds  by  adjectives  that  convey  an  idea  of  relative  intensity  and  pitc’- 
Different  clinicians  use  the  same  term  to  describe  dissimilar  soun^ 
[1,9]. 


Adventitious  sounds  are  divided  into  those  believed  to  have  a 
bronchopulmonary  origin  and  those  thought  to  be  due  to  pleural  disease. 
Those  of  bronchopulmonary  origin  are  further  subdivided  into  "continuous" 
and  "discontinuous".  Sounds  that  last  for  more  than  a  tiny  fraction  of 
the  respiratory  cycle  are  referred  to  as  continuous.  Rales,  the 
discontinuous  sounds,  are  further  subdivided  into  fine,  medium,  and  coarse. 
A  variety  of  adjectives  appear  in  the  medical  literature  to  classify  these 
sounds  further.  Examples  of  these  adjectives  include  dry,  wet,  moist, 
bubbling,  crepitant,  subcrepitant,  and  consonating.  The  terminology  is 
subjective  in  its  interpretation  depending  greatly  on  the  hearing  and 
experience  of  the  clinician  [1,2,12,13,25-27]. 

Another  problem  in  chest  auscultation  is  that  so  much  information 
exists  that  it  is  difficult  either  to  record  it  properly  or  to  remember 
the  observed  details.  On  listening  over  a  single  site  in  the  chest,  it  is 
possible  to  observe  the  intensity  of  both  inspiration  and  expiration.  It 
is  also  possible  to  grade  these  on  a  scale  that  may  reflect  normality  or 
abnormality  of  the  site.  The  clinician  may  also  be  able  to  record  the 
presence  or  absence  of  various  adventitious  sounds  and  their  relationship 
to  the  respiratory  cycle.  The  duration  of  inspiration  with  respect  to 
expiration  may  also  be  noted.  If  the  clinician  listens  to  one  or  more 
sites,  as  is  common  in  routine  chest  auscultation,  then  there  is  a 
possibility  thr.t  information  may  be  lost  due  to  various  factors  such  as 
interruptions  in  the  physical  examination,  lack  of  accurate  record 
keeping,  or  the  clarity  of  the  clinician's  initial  observations  [25].  The 
diversity  of  the  terms  used  to  describe  respiratory  sounds,  together  with 
the  nonuniformity  of  their  usage  by  clinicians,  poses  difficulties  m  the 
use  of  respiratory  sounds  as  a  precise  indicator  of  the  condition  of  the 
respiratory  system. 


Limitations  of  Instrumentation 


The  binaural  stethoscope  appeared  towards  the  middle  of  the 
nineteenth  century  and  became  popular  mainly  because  it  excludes 
extraneous  noise  [15,24].  The  choice  of  the  best  chest  piece  remained 
controversial  until  physicians  agreed  that  both  the  diaphragm  and  the  bell 
were  necessary  for  auscultation  of  the  heart.  The  diaphragm  and  bell  are 
combined  in  most  stethoscopes  in  current  use. 

The  stethoscope  transmits  the  range  of  frequencies  which  includes 
frequencies  of  heart  and  lung  sounds.  By  varying  the  pressure  between  the 
chest  piece  and  the  skin,  the  intensities  of  certain  frequencies  are 
increased  while  others  are  decreased.  Low-pitched  heart  sounds  are  heard 
best  with  the  bell  resting  lightly  on  the  skin,  while  firm  pressure  of  the 
bell  or  diaphragm  increases  the  intensity  of  higher  frequencies  and 
suppresses  unwanted  low-pitched  sounds.  Respiratory  sounds  contain  a  wide 
range  of  frequencies.  To  compare  relative  frequency  intensities  within  a 
particular  sound  spectrum,  the  measuring  instrument  should  not  contribute 
variations  in  intensity.  The  conventional  stethoscope  exhibits  this 
limitation  [11,25]. 

Sound  evaluation  is  further  complicated  by  the  nonlinearity  of  the 
human  auditory  system.  The  ear  is  capable  of  distinguishing  small 
differences  in  pitch.  A.;  the  intensity  of  the  sound  increases,  the 
sensitivity  of  the  ear  to  intensity  variations  decreases  logarithmically. 
The  ears'  perception  of  intensity  falls  off  at  both  ends  of  the  frequency 
spectrum  [11].  The  frequencies  of  sound  that  a  young  person  can  hear 
differ  from  the  frequencies  of  sound  that  an  older  person  can  hear.  The 
range  falls  between  30  and  20,000  cycles  per  second (cps)  for  a  young 
person  and  50  to  8,000  cps  in  old  age  [16].  The  ear  is  unable  to 
distinguish  short  sound  bursts.  A  burst  shorter  than  3  ms  will  be  heard 
only  as  a  click  regardless  of  the  frequency  [13]. 

Electronic  instruments  can  be  designed  to  exhibit  a  flat  frequency 
response  over  the  range  of  respiratory  sound  spectrum  and  thus  overcome 
some  of  the  instrumentation  shortcomings.  The  major  obstacles,  however, 
in  the  acceptance  and  advancement  of  using  respiratory  sounds  as  a  major 
clii.-wal  tool  in  pulmonary  medicine  are  the  lack  of  complete  understanding 
of  the  mechanism  of  production  of  respiratory  sounds,  and  the  sources  from 
which  respiratory  sounds  are  generated. 

Since  Laennec's  time  it  has  been  known  that  pulmonary  pathology  can 
cause  a  change  in  respiratory  sound.  Differences  exist  between  normal  and 
pathological  respiratory  sounds.  In  1976,  Grassi  et  al.  [15]  used  the 
technique  of  phonopneuroography  to  analyze  respiratory  sounds . 
Phonopneumography  is  defined  as  the  technique  of  detecting  and  analyzing 
the  sounds  that  are  produced  in  the  bronchopulmonary  area  during 
respiration.  The  apparatus  used  was  intended  to  provide  a  graphic 
recording  of  the  level  of  the  sounds  the  clinician  perceives  through  the 
stethoscope  during  auscultation.  The  phonopneumographic  records  revealed 
that  inspiration  was  louder  than  expiration  in  the  healthy  subjects. 
Wherever  the  ratio  between  inspiratory  and  expiratory  peak  amplitudes  was 


found  to  be  highly  modified,  either  because  inspiration  was  much  louder 
than  normal,  or  because  an  inversion  of  the  ratio  occurred  with  an 
expiratory  sound  louder  than  inspiratory,  the  finding  was  always 
accompanied  by  pathological  alteration  of  the  pulmonary  zone. 

Chowdhury  and  Maj under  [4]  conducted  an  experiment  using  digital 
spectral  analysis  of  respiratory  sounds  for  the  purpose  of  determining  the 
clinical  relationship  between  the  frequency  spectrum  and  the  conditions  of 
the  lungs  in  pulmonary  diagnosis.  Their  study  included  6  normal  subjects 
and  6  tuberculosis  patients  with  fibrosis.  The  recordings  of  respiratory 
sounds  were  made  in  a  quiet  room  with  the  subject  in  the  supine  position 
and  the  microphone  placed  on  the  right  lung  base.  The  Fast  Fourier 
Transform(FFT)  algorithm  of  Cooley  and  Tukey  was  used  to  obtain  the 

normalized  autospectrum  of  0.25s  time  segments  of  respiratory  sound.  Their 
analysis  indicates  a  maximum  amplitude  of  about  250  Hz  for  subjects 
without  pathological  lung  history,  with  rapid  decrease  in  amplitude  as  the 
frequency  increases  and  approaches  1000  Hz.  In  the  case  of  the  tubercular 
lung,  a  downward  frequency  shift  of  amplitude  peak  and  the  presence  of 
higher  frequency  components  were  observed. 

Charbonneau  et  al.  [3]  developed  an  index  which  they  used  to 
discriminate  between  asthmatics  and  normal  subjects.  They  calculated  the 
average  spectrum  for  inspiration  and  expiration  and  referred  to  it  as  a 
histogram.  Four  parameters  for  both  expiratory  and  inspiratory  histograms 
were  calculated  and  the  sum  of  the  parameters  was  used  as  an  index  to 

discriminate  between  asthmatics  and  normal  subjects.  The  parameters  were: 
the  bandwidth  (taken  at  half  of  the  peak  amplitude),  the  integral  over  the 
range  60-1260  Hz,  the  highest  significant  frequency  (taken  to  be  10%  of 
the  amplitude  of  the  peak  frequency),  and  the  weighted  mean  frequency  of 
each  mean  spectra.  Their  study  included  11  normal  and  10  asthmatic 
subjects . 

Problems  with  Sound  Intensity  Recorded  from  Chest  Wall 

The  correlation  of  respiratory  sound  intensity  and  the  distribution 

of  pulmonary  ventilation  was  first  studied  by  Leblanc  et  al.  [21].  They 

concluded  that  the  intensity  of  respiratory  sounds  varied  with  lung 
volume,  flow  rate,  body  orientation,  and  the  site  of  the  recording. 
O’Donnell  and  Kraman  [28]  and  Dosani  and  Kraman  [7]  conducted  studies  to 
investigate  the  intensity  patterns  of  lung  sound  on  the  chest  wall.  They 
concluded  that  there  was  a  considerable  intersubject  and  intrasub]ect 
variability  in  amplitude  of  the  inspiratory  vesicular  sound  heard  on  the 
chest  wall,  and  that  the  variability  was  due  to  factors  other  than  the 
distribution  of  ventilation  and  chest  wall  thickness.  These  variations 
happen  even  with  normal  subjects  without  any  diseases  of  the  lung.  They 
believed  that  the  other  factors  included  the  site  of  production  of  these 
sounds  and  their  transmission  through  the  airways  and  lung  tissue.  Dosani 
and  Kraman  (7)  pointed  out  that  the  chest  wall  thickness  may  not  have  a 
predominant  effect  on  the  intensity.  Their  results  showed  that  sound 
intensity  at  the  lateral  wall  was  similar  to  sound  at  positions  near  the 
spine  where  the  thickness  of  the  chest  wall  is  greater. 


Rationale  for  Trachea  as  Site  of  Respiratory  Sound  Detection 

The  variability  of  acoustic  properties  of  the  chest  wall  account  for 
the  variability  of  sound  intensity  as  measured  at  the  chest  wall 
[3,8,16,29,30] .  Previous  research  indicates  that  respiratory  sounds 
measured  at  the  trachea  undergo  minimum  filtering  [3,8,16].  Charbonneau 
[3]  stated  that  the  sound  level  is  higher  at  the  trachea  than  at  any  other 
point  of  the  chest  or  back  and  localizing  the  point  is  more  precise. 
Therefore,  recordings  of  the  respiratory  sound  for  this  study  were 
obtained  from  the  area  of  the  trachea. 


EXPERIMENTAL  PROCEDURES 

Data  were  obtained  from  the  USAF  School  of  Aerospace  Medicine  and 
were  collected  by  USAF  personnel  at  Wilford  Hall  USAF  Medical  Center  on 
patients  with  pulmonary  insufficiency  and  on  normal  volunteers.  Patients, 
were  classified  from  results  of  their  pulmonary  function  test  (PFT).  The 
experimental  procedure  used  by  the  U.S.  Air  Force  for  data  collection  was 
as  follows. 

The  patient  was  instrumented  with  a  pulmonary  flowmeter  that  was 
comprised  of  two  Fleisch  pneumotachometers  with  a  Rudolph  valve  between 

them  and  connected  to  a  mouthpiece.  One  pneumotachometer  was  used  to 

transduce  the  inspiratory  flow  rate  and  the  other  one  was  used  to 

transduce  the  expiratory  flow  rate.  The  pneumotachometer  devices  had 

pressure  taps  that  were  connected  to  Validyne  pressure  transducers.  An 
electronic  stethoscope  was  held  at  the  anterior  cervical  triangle  for  the 
detection  of  respiratory  sounds .  The  patient  breathed  through  the 
flowmeter  exclusively,  using  a  nose  clip  to  prevent  nose  breathing.  A 
minimum  of  5  min  of  recording  time  for  each  patient  was  collected  to  allow 
the  patient  to  become  accustomed  to  the  apparatus  and  to  establish  a 
normal  breathing  pattern.  The  respiratory  sounds  and  the  flow  rate  were 
transduced  and  recorded  on  an  FM  analog  tape  recorder. 

Experimental  Procedure  for  Data  Analysis 

The  magnetic  tapes  were  under  contract  to  Texas  A&M  University, 
Bioengineering  Department  for  data  analysis.  Since  the  classification 
(normal /pulmonary  insufficiency)  of  the  subject,  number  of  subjects,  and 
location  of  each  subject  in  relation  to  the  time  code  was  not  well 
controlled  by  the  U.S.  Air  Force,  the  study  results  suffered.  Of  the  35 
subjects,  only  24  could  be  identified.  Not  all  subject  records  contained 
their  PFT  or  their  classification. 

The  equipment  used  in  the  analysis  of  the  magnetic  tapes  consisted  of 
the  following  components: 

•  Ampex  2200  FM  analog  tape  recorder 

•  Datum  Time  Code  Generator /Reader  Model  9300 

•  20-Hz,  1800-Hz  low-pass  filters 

•  100-Hz  high-pass  filter 


•  A/D  Multiprogrammer  (HP  6942A) 

•  Tektronix  5A22N  Differential  Amplifier 

•  Tektronix  5111A  Storage  Oscilloscope 

•  Tektronix  2236  (100  MHz)  Storage  Oscilloscope 

The  arrangement  of  the  equipment  used  for  analysis  of  inspiratory  and 
expiratory  data  is  shown  in  Figure  1. 

Inspiratory  Data  Analysis 

As  shown  in  Figure  1,  the  respiratory  sound  signal  and  the 
inspiratory  flow-rate  signal  were  monitored  simultaneously  on  an 
oscilloscope.  The  magnetic  tape  speed  was  3-3/4  in./s.  Recordings  were 
made  with  standard  Inertial  Rate  Integrating  Gyro  (IRIG)  intermediate  band 
record/reproduce  amplifiers  with  a  cutoff  frequency  (3  dB)  at  19  kHz  and  a 
signal-to-noise  ratio  of  35  dB  (root-mean-square(rms)  signal-rms  noise). 
Channel  10  of  the  FM  recorder  was  connected  to  the  time  decoder  to  display 
the  recorded  time.  The  time  decoder  was  monitored  until  a  valid  time  code 
was  displayed,  indicating  a  subject's  data  were  recorded  on  Channel  2 
(inspiratory  flow  data)  and  on  Channel  4  (respiratory  sound  data)  of  the 
magnetic  tape.  Then,  the  time  code  and  signals  were  simultaneously 
monitored  and  the  total  length  of  the  subject’s  inspiratory  data  was 
recorded.  The  total  length  of  time  was  assumed  to  be  the  time  between  the 
beginning  of  a  time  code  and  the  time  when  the  time  code  cleared.  Then, 
the  time  of  each  inspiratory  breath  was  also  recorded.  A  calibration 
procedure  was  performed  on  the  inspiratory  flow-rate  signal  to  calculate 
the  gain  that  was  used.  Transducer  transfer  gain  (volts/liters/second)  is 
multiplied  times  recording  gain  (XI)  and  playback  equipment  gain  (Gp>  to 
yield  "overall  gain"  (Gt).  Voltage  levels  divide  by  Gt  results  in 
physical  units  of  flow  in  liters/ second.  This  value  was  used  in  making 
the  graphs  of  flow  vs.  time.  Flow  in  voltage  levels  (or  analog  to  digital 
(A/D)  counts)  were  used  to  trigger  collection  of  respiratory  sounds.  For 
a  specific  inspiratory  breath  the  signal  on  the  magnetic  tape  was  viewed 
on  an  oscilloscope  and  the  voltage  of  the  signal  was  recorded.  The  signal 
from  the  output  of  the  20-Hz  filter  for  the  same  inspiratory  breath  was 
viewed  on  another  oscilloscope  simultaneously  and  the  voltage  of  the 
signal  was  recorded.  The  gain  was  calculated  by  the  ratio  of  the  voltage 
recorded  from  the  output  of  the  20-Hz  filter  to  the  voltage  recorded 
directly  off  the  magnetic  tape.  The  value  of  the  gain  was  necessary  in 
determining  the  correct  inspiratory  flow  rate.  This  procedure  was  done 
prior  to  inspiratory  data  collection  for  each  new  subject. 

The  inspiratory  flow-rate  signal  was  taken  from  Channel  2  of  the  FM 
recorder  and  fed  into  the  20-Hz  low-pass  filter.  The  output  of  the  filter 
was  fed  into  both  the  oscilloscope  for  monitoring  and  the  analog  input  box 
for  manual  triggering.  The  output  of  the  analog  input  box  was  then  fed 
into  Channel  2  of  the  A/D  converter. 

The  inspiratory  sound  signal  was  taken  from  Channel  4  of  the  FM 
recorder  and  fed  into  a  100-Hz  high-pass  filter  followed  by  a  1800-Hz  low- 
pass  filter.  The  output  of  the  filter  was  fed  into  both  the  oscilloscope 
for  monitoring  and  into  Channel  1  of  the  A/D  converter.  After  calibration 


of  the  flow  data  and  random  selection  of  four  breaths,  data  collection 
began. 

The  HP-9836  desktop  microcomputer  was  programmed  to  monitor  the 
inspiratory  flow  rate  via  the  A/D  channel  and  begin  data  collection  when 
the  flow  rate  reached  a  predetermined  level.  The  level  was  determined  by 
viewing  the  signal  on  the  oscilloscope.  Data  collection  was  handled  by  the 
multiplexed  A/D  channel  at  the  sampling  rate  of  8192  samples  per  second. 
By  choosing  a  sampling  rate  of  8192  samples  per  second,  the  result  was  to 
sample  each  analog  channel  (respiratory  sound  and  flow  rate)  at  4096 

samples  per  second.  The  program  was  modified  to  be  interactive;  therefore, 
it  allowed  for  versatility  in  the  processing  of  recorded  data  with  large 
variabilities.  The  program  prompted  the  operator  for  changes  of  the  flow 
rate  trigger  level,  the  duration  of  data  collection,  the  number  of 
repetitions,  the  data  file  name,  the  data  file  structure,  and  the  sampling 
rate.  The  program  also  had  additional  flexibility  that  allowed  the 

operator  to  decide  whether  or  not  to  save  the  data  that  was  collected. 

Much  of  the  data  contained  extraneous  noises  (i.e.,  talking,  babies  crying, 
door  slamming,  etc.). 

The  voltage  level  at  which  the  A/D  converter  started  collecting  data 
was  displayed  on  the  screen.  If  the  level  was  acceptable  with  the  trigger 
level  that  was  set  (within  0.2  V),  then  the  data  were  saved  and  stored  on 
the  diskette.  The  data  file  names  used  were  the  respective  time  codes  of 
the  breaths  collected.  This  procedure  continued  until  4  inspiratory 

breaths  of  each  subject  had  been  collected,  digitized,  and  stored  on 
diskettes  for  further  signal  processing. 


Expiratory  Data  Analysis 

As  shown  in  Figure  1,  both  the  expiratory  flow-rate  signal  and  the 
expiratory  sound  signal  were  monitored  simultaneously  on  an  oscilloscope. 
The  same  procedure  for  locating  the  inspiratory  data  was  used  to  locate 
expiratory  data. 

A  calibration  procedure  was  then  performed  on  the  expiratory  flow-rate 
signal  to  calculate  the  gain  that  was  used.  For  a  specific  expiratory 
breath,  the  signal  off  the  magnetic  tape  was  viewed  on  am  oscilloscope  and 
the  voltage  of  the  signal  was  recorded.  The  signal  from  the  output  of  the 
20-Hz  filter  for  the  same  expiratory  breath  was  viewed  on  an  oscilloscope 
and  the  voltage  of  the  signal  was  recorded.  The  gain  was  calculated  by  the 
ratio  of  the  voltage  recorded  from  the  output  of  the  20-Hz  filter  to  the 
voltage  recorded  directly  off  the  magnetic  tape.  The  value  of  the  gain  was 
necessary  in  determining  the  correct  expiratory  flow  rate.  This  procedure 
was  done  prior  to  expiratory  data  collection  for  each  new  subject. 

The  expiratory  flow-rate  signal  was  taken  from  Channel  3  of  the  FM 
recorder  and  fed  into  the  differential  amplifier  of  the  oscilloscope.  The 
differential  amplifier  was  used  to  balance  the  direct  current  (DC)  offset 
by  adjusting  the  DC  step  attenuation  balance  and  the  position  knob.  The 
filter  setting  was  DC.  This  step  was  needed  because  the  expiratory  flow 
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transducer  that  was  used  had  a  1.4  v  DC  off  sat.  The  output  of  the 
oscilloscope  was  fad  into  the  input  of  the  20-Hz  filter,  and  the  output 
of  the  20-Hz  filter  was  fad  into  both  the  oscilloscope  for  monitoring  and 
the  analog  input  box  for  manual  triggering.  The  output  of  the  analog 
input  box  was  then  fad  into  Channel  2  of  the  A/D  converter.  The 
respiratory  sound  signal  was  taken  from  Channel  4  of  the  7M  recorder  and 
fad  into  a  100-Hz  high-pass  .'-Iter  followed  by  a  1800 -Hz  low-pass  filter. 
The  output  of  the  filter  was  fed  into  both  the  oscilloscope  for  monitoring 
and  into  Channel  1  of  the  A/D  converter. 

The  same  procedure  of  expiratory  data  collection  using  the  HP-9836 
microcomputer  and  A/D  converter  as  noted  for  the  inspiratory  data 
collection  was  used.  This  procedure  continued  until  4  expiratory  breaths 
of  each  subject  had  been  collected,  digitized,  and  stored  on  diskettes  for 
further  signal  processing. 


QUANTITATIVE  ANALYSIS 

The  Fourier  series  is  used  to  represent  arbitrary  periodic  functions 
by  an  infinite  series  of  sinusoids  of  harmonically  related  frequencies  to 
study  the  time  domain  responses  in  networks.  The  Fourier  series  expressed 
as  a  linear  combination  of  harmonically  related  complex  exponentials  can 
be  written  in  the  form: 


x(t)  «  /  ane*5^©1  dt  (1) 

ak  •  1/T0  /  x(t)e'llu'ot  dt  (2) 


where  k  *  1,  2,...,  ■. 

Equation  (1)  is  often  referred  to  as  the  synthesis  equation  and 
equation  (2)  as  the  analysis  equation.  The  coefficients,  ak,  are  often 
called  the  Fourier  series  coefficients  or  the  spectral  coefficients  of 
x(t).  These  complex  coefficients  measure  the  portion  of  the  signal  x<t) 
that  is  at  each  harmonic  of  the  fundamental  component.  The  fundamental 
frequency  is  defined  as  u0,  and  the  fundamental  period  is  T0»2 x/w0. 

The  Fourier  series  method,  however,  has  limitations  in  analyzing 
linear  systems  for  the  following  reasons:  1)  The  Fourier  series  can  be 
used  for  inputs  which  are  periodic;  however,  most  inputs  in  practice  are 
nonperiodic.  2  The .  method  applies  only  to  systems  that  are  stable.  A 
stable  system  is  a  system  whose  natural  response  decays  m  time. 

The  first  limitation  can  be  overcome  since  we  can  represent  the 
nonperiodic  input  m  terms  of  exponential  components.  A  method  of 

accomplishing  this  function  is  the  Fourier  transform.  For  instance, 
consider  the  nonperiodic  function  f't;  which  we  would  like  to  represent  in 
terms  of  exponential  components.  To  do  this,  we  constructed  a  periodic 
function  f,(t)  with  a  period  T,  where  the  function  f(t>  is  repeated  every 
T  seconds.  The  period  T,  is  considered  large  enough  so  there  is  no 
overlapping  between  pulse  shapes  of  ftt).  The  new  function  is  a  periodic 
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function  and  can  be  raprasantad  with  an  axponantial  Fourier  series  as 
follows : 


fT(t)  -  ZFne3n"o’ 

(3) 

where  u0»  2jt/T, 

(4) 

and  Fn  «  1/T  Z  fT(t)e'jnwot 

(5) 

The  next  process  is  to  evaluate  the  function  as  the  period  increases 
to  infinity.  As  T  becomes  infinite,  the  pulses  repeat  after  an 
infinite  interval.  Therefore,  fT(t)  and  f(t)  are  identical  in  the  limit, 
and  the  Fourier  series  representing  the  periodic  function  f.(t)  will  also 
represent  f(t). 

In  the  limit,  as  T  approaches  infinity,  u  approaches  zero. 
Therefore,  u0  can  be  denoted  as  Su.  Then:  T  *  2it/<j0*2v/8<j0  and, 

TFn-  IfT(t)e*^n6"t  (6) 

TFn  is  a  function  of  jn6«,  so  let  TF*F(jn6u).  Equation  (3)  becomes: 

fT(t)  «  Z  Fpe^o1  (7) 

•  Z[F(  jnSwi/Tje' 3n4‘“,t  (8) 

*  I{[F(jn6u)/2jr]Sw}  e(;)n6“)t  (9) 

In  the  limit,  as  T  approaches  infinity,  6«  approaches  zero,  and  fT(t) 
approaches  f ( t ) .  Then : 

f(t)  ■  Lim  f.(t)«- Lim  1/2*  F(jn6«j)  e5n6tot&^]  (10) 


is  by  definition: 


f(t)  ■  1/2*  I  F(jw)ejwt  d u 

(11) 

Equation  ( 11 ) 

is  known  as  the  inverse  Fourier  transform. 

Recall 

equation  ( 5) : 

F.  -  1/T  I  C~(  t)e~  ^ni‘o*  dt 

-  12 ) 

*  F ( jn6w)/T 

<  13 ' 

then 

F(  ju)  »  Lim  /  f,(t >e'5n6wtdt 

<  14  i 

F ( ju)  =  /  f(t;e'lwt  dt 

■  15  * 

Equation  '15;  is  known  as  the  direct  Fourier  transform.  The  result  is 
that 

F(ju)  -  J  f(t)e'3wt  dt  (16) 


is  the  representation  of  the  nonperiodic  function  f(t)  in  terms  of 
exponential  functions.  The  amplitude  of  the  component  of  any  frequency  u 
is  proportional  to  F(ju).  In  analogy  with  the  terminology  used  for  the 
Fourier  series  coefficient  of  a  periodic  signal,  the  transform  F(j<j)  of  an 
aperiodic  signal  f(t)  is  commonly  referred  to  as  the  spectrum  of  f(t),  as 
it  provides  the  information  concerning  how  f(t)  is  composed  of  sinusoidal 
signals  at  different  frequencies. 

The  second  limitation  applies  to  nonstationary  data  from  a  system. 
During  normal  respiration  the  respiratory  system  is  stable.  If  the  signal 
from  a  system  is  stationary,  the  system  may  be  assumed  to  be  stable.  Weak 
stationarity  is  shown  in  the  ergodic  sense  by  averaging  several  records  of 
an  individual  at  one  time  and  averaging  another  set  from  the  same  subject 
at  a  different  time,  if  the  means  and  autocorrelation  are  invariant  the 
data  is  weakly  stationary.  This  procedure  was  shown  in  the  statistics  in 
reference  34.  Hence,  for  this  study  the  data  is  assumed  to  be  stationary, 
which  is  the  case  for  mean  data,  but  not  necessarily  for  individual  sample 
data. 

In  the  study,  the  respiratory  signal  was  conditioned  by  the  use  of 
the  analog  filters  previously  discussed  in  the  experimental  procedures. 
The  100-Hz  high-pass  filter  and  the  1800-Hz  low-pass  filter  were  thus 
acting  together  as  a  band-pass  filter.  The  respiratory  sound  data  and 
flow  data  were  digitized,  separated  using  the  separation  program,  and 
stored  on  diskettes  for  the  Fast  Pourier  Transform  (FFT)  analysis.  The 
respiratory  sound  signal  was  digitized  at  a  rate  of  4096  samples/ second. 
The  sampling  rate  was  chosen  to  avoid  "aliasing"  of  the  spectra.  The 
literature  indicates  that  the  highest  frequency  of  normal  respiratory 
sound  is  about  1500  Hz,  and  this  may  be  higher  for  abnormal  subjects.  The 
Hyquist  sampling  criteria  requires  that  the  Nyquist  frequency  be  at  least 
twice  the  highest  frequency  of  interest  of  the  signal  being  sampled.  The 
first  1024  samples,  which  correspond  to  0.25  s  of  real-time  data,  were 
transformed  by  the  Cooley-Tukey  FFT  algorithm. 

When  only  a  finite  segment  of  the  signal  is  observed,  the  process  is 
equivalent  to  multiplying  the  signal  by  a  rectangular  window  function.  In 
the  frequency  domain,  this  multiplication  becomes  a  convolution  between 
the  desired  spectrum  and  that  of  the  window.  As  a  result,  the  frequency 
spectrum  is  distorted,  and  the  spectral  components  "leak"  away  from  their 
time  frequencies  and  are  distributed  over  the  total  spectrum  [32].  A 
rectangular  window  function  is  not  accurate  m  describing  the  signal,  and 
therefore  produces  signal  discontinuities  at  the  boundaries.  The  FFT  will 
add  all  harmonics  to  simulate  the  fast  rising  edge  of  the  window,  and  this 
is  inaccurate  in  representing  the  signal.  Frequency  distortion  occurs 
when  part  of  the  true  frequency  is  distributed  to  the  side  lobes.  A 
EBctanqular  window  results  m  a  sharper  cutoff,  more  narrow  mam  lobe,  but 
the  1st  side  lobe  is  only  -13  dB  down  with  the  remaining  side  lobes 
decreasing  at  -6  dB/octave.  The  Hanning  window,  which  is  often  called  the 
cosine  window,  reduces  the  side  lobe  level  to  -32  dB  with  a  -18  dB/octave 
roll-off  in  the  side  lobes.  A  suitable  window  function  needs  to  be  chosen 
to  minimize  the  "spectral  leakage".  The  window  chosen  was  a  10%  cosine 
tapered  window.  This  window  forces  the  first  data  point  to  zero,  and  the 


rising  edge  of  the  window  is  a  cosine  function.  A  correction  factor  of  G,. 
■  G./0.875  is  applied  to  the  spectrum  resulting  form  the  modified  cosine 
window  as  suggested  by  Bendat.  Discontinuities  in  general  lead  to  the 
"Gibbs  phenomenon"  which  infers  that  even  with  an  infinite  Fourier  series 
there  will  be  a  9%  error  at  the  discontinuities. 

The  power  spectrum  is  computed  from  the  square  of  the  resulting 
Fourier  coefficients. 

Gi  *  C?  ■  (a^  )  (a^-jb^ )  '17) 

G:  ■  C*  -  a*  *  b?  18  ■ 

Calculation  of  Parameters 

Three  indices  of  measure  were  calculated  from  the  power  spectrum. 
These  parameters  were  the  mean  frequency  of  the  power  spectrum  ( MPF ; ,  the 
frequency  of  the  maximum  power  (FPK),  and  the  highest  frequency  at  which 
the  power  in  the  spectrum  equals  or  is  less  than  10%  of  the  maximum  power 
( FMAX) .  The  rationale  for  selecting  these  parameters  is: 

til  In  descriptive  statistics,  the  mean  iMPF^  corresponds  to  the 
central  tendency  of  the  distribution. 

>2)  The  peak  value  (FPK)  or  the  mode  of  a  distribution  corresponds  to 
the  most  frequent  occurrence  of  an  event,  in  this  case  the  power 
within  the  respiratory  sound  spectrum. 

(3)  The  highest  frequency  <"FMAX)  at  which  the  power  in  the  spectrum 
becomes  10%  of  the  peak  power  corresponds  to  a  rough  bandwidth 
from  DC  to  the  frequency  at  which  the  power  contents  remain  10  dB 
below  FMAX  m  the  spectrum.  This  parameter  may  also  be  used  as 
an  indicator  of  the  correctness  in  the  selection  of  the  Nyquist 
sampling  frequency.  In  addition,  Charbonnaau  et  al .  [3]  used  the 
FMAX  parameter  as  part  of  a  linear  function  to  discriminate 
normal  patients  from  asthmatic  patients. 

The  mean  frequency  of  the  power  spectra  was  calculated  from  the 
equation: 

MPF  «  I  C?*F^  /ZC?  19 

where  i  is  the  index  0,  1,  2, . .  M-i. 

F  is  the  frequency  at  index  i,  and  C.  is  the  Fourier  coefficient  at  index 
i.  The  coefficient  C_  is  computed  from  the  FFT : 

C  *  a‘:*b:  '  ;  2C 

where  a_^  is  the  real  component  of  the  Fourier  coefficient  at  index  i,  and 
is  the  imaginary  component  of  the  Fourier  coefficient  at  index  i. 
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The  next  paraMttr  calculated  by  tha  program  is  the  frequency  of  the 
FMAX  of  the  power  spectrum.  A  sorting  routine  is  used  to  locate  the  FMAX 
and  the  index  at  which  the  FMAX  occurred.  The  peak  frequency  then  becomes 


where  I  is  the  index  0,2,... ,N-1,  and  PR  is  the  frequency  resolution  (PR). 
The  PR  is  calculated  from  the  period  or  window  length  in  time,  that  is: 


FR  -  1/(0. 25  s )  —  4  Hz 


The  final  parameter,  the  highest  frequency  at  10%  of  the  FMAX,  is 
calculated  in  a  manner  similar  to  the  calculation  of  the  frequency  of  the 
PMAX .  Again,  a  sorting  routine  is  used  to  find  the  index  at  which  the 

power  becomes  10%  of  the  FMAX;  then  the  frequency  is  calculated  f ran  the  index. 


DISCRIMINANT  ANALYSIS 


A  basic  problem  in  clinical  diagnosis  is  that  of  assessing  and 
classifying  individuals  based  on  a  number  of  physiological  attributes. 
This  process  requires  well-defined,  mutually  exclusive  classes,  to  one 
of  which  the  individual  in  question  must  belong. 


Several  decision-making  procedures,  varying  somewhat  in  approach, 
complexity,  and  overall  effectiveness,  are  applicable  to  the 
classification  of  observed  data.  Discriminant  analysis  is  a  method  of 
assigning  an  observation  to  1  of  2  or  more  distinct  groups.  Discriminant 
analysis  is  often  viewed  as  a  technique  for  the  description  and  testing  of 
between-group  differences  [5].  Yet  others  view  discriminant  analysis  as  a 
problem  of  assigning  an  unknown  observation  to  a  group  with  low-error  rate 


The  basic  idea  of  the  discriminant  analysis  problem  is  that  we  assume 
that  the  initial  data  can  be  classified  correctly  (i.e.,  in  defining  the 
groups)  and  some  variables  exist  that  permit  the  groups  to  be  established. 
Unfortunately  there  is  little  theory  to  guide  the  selection  of  measurement 
variables . 


A  second  fundamental  requirement  is  a  method  of  assessing  or 
assigning  group  membership.  One  approach  is  based  on  the  concept  of 
generalized  distance  developed  by  Mahalanobis  [23]  m  lieu  of  the  concept 
of  ordinary  geometric  Euclidean)  distance.  The  generalized  distance  is  a 
measure  of  group  or  profile  dissimilarity  [33],  i.e.,  the  larger  the 
generalized  distance  '23  the  greater  the  dissimilarity  between  profiles. 


Discriminant  programs  available  at  Texas  ALM  University  are 
statistical  packages  within  the  Statistical  Analysis  System  SAS  [31]. 
The  basic  SAS  discriminant  procedures  used  m  this  study  were  stepwise 
discriminant  STEPDISC’,  canonical  discriminant  'CANDISC-,  and 
discriminant  analysis  'DISCRIM),  respectively.  The  STEPDISC  and  CANDISC 
programs  were  used  first  to  evaluate  subsets  of  variables.  The  CANDISC,  a 
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dimension  reduction  technique,  is  related  to  principal  components  and 
canonical  correlation.  The  CANDISC  procedure  finds  linear  combination  of 
the  variables  that  best  summarizes  the  differences  among  the  classes.  In 
addition,  this  procedure  computes  scores  for  each  observation  on  the 
linear  combination.  The  STEPDISC  trys  to  find  a  subset  of  variables  that 
best  reveals  differences  among  the  classes.  After  a  subset  of  variables 
is  selected  with  the  STEPDISC  or  CANDISC  procedure,  the  discriminant 
procedure,  DISCRIM,  may  be  used  to  classify  the  observations  into  two  or 
more  known  classes  [31]. 

Stepwise  Discriminant  Analysis  Program 

The  STEPDISC  program  performs  the  stepwise  discriminant  analysis 
which  selects  a  subset  of  variables  to  form  a  discriminant  model  by 
forward  selection,  backward  elimination,  and  stepwise  selection. 
Variables  are  chosen  on  the  basis  of  either  criteria: 

•  The  significance  level  of  am  F-test  from  an  analysis 
of  covauriance,  or 

•  The  squared  partial  correlation  for  predicting  the  variable 
under  consideration  from  the  class  variable. 

The  stepwise  selection  begins  with  no  variable  in  the  model.  The 
variable  that  contributes  the  most  to  the  discriminatory  power  of  the 
model  as  measured  by  Wilks'  lambda  criterion  is  entered  first.  After  all 
variables  of  the  model  have  been  tested  by  the  criterion  for  entry  into 
the  model,  the  stepwise  selection  stops.  Stepwise  discriminant  models  are 
not  necessarily  the  best  possible  models  since  Wilks'  lambda  may  not  be 
the  best  measure  of  discriminatory  power.  To  guard  against  inclusion  of 
variables  that  do  not  contribute  to  the  discriminatory  power  of  the  model, 
it  is  customary  to  specify  or  select  a  minimum  significance  level. 

Canonical  Discriminant  Analysis  Program 

The  CANDISC  analysis  is  a  dimension-reduction  technique  related  to 
principal  component  analysis  and  canonical  correlation.  The  CANDISC 

program  computes  and  tests  Mahalanobis  distances  (23),  performs  univariate 
and  multivariate  analysis  of  variance,  and  derives  a  linear  combination  of 
variables,  called  the  first  canonical  correlation,  that  has  the  highest 
possible  multiple  correlation  within  groups.  The  coefficients  of  the 
linear  combination  are  the  canonical  coefficients.  The  variable  defined 
by  the  linear  combination  is  the  first  canonical  variable.  The  second 
canonical  correlation  is  obtained  by  finding  the  linear  combination 
uncorreiated  with  the  first  canonical  variable.  The  number  of  canonical 
variables  equals  the  number  of  variables  or  the  number  of  groups  minus 
one,  whichever  is  smaller.  The  output  of  the  program  includes  the 
univariate  statistics,  covariance  matrices,  the  Mahalanobis  distances  23 
between  classes,  multivariate  statistics,  the  canonical  coefficients,  and 
p^ots  of  paired  canonical  variables.  These  plots  are  an  aid  to  visua. 
interpretation  of  group  differences  [31]. 
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Discriminant  Analysis  Program 


The  DISCRIM  program  develops  a  discriminant  model  (classification 
criterion)  by  means  of  the  generalized  squared  distance  under  the 
assumption  that  each  class  has  a  multivariate  normal  distribution.  The 
generalized  squared  distance  of  observation  vector  to  group  or  class  j 
is  obtained  from  the  equation: 

Dj(Xt)  «  (Xt-X3)'  OOVj^Xi-Xj)  ♦  LogJCOVjl  (23) 

if  the  within-group  covariance  matrices,  COV.  ,  are  used  X,  is  the  means 
of  the  variables  in  the  group  j.  The  criterion  is  based 'on  either  the 
individual  within-group  covariance  matrices  or  the  pooled  covariance 
matrix.  The  DISCRIM  program  then  computes  linear  or  quadratic 
discriminant  functions  and  places  each  observation  (multivariate)  into  the 
class  from  which  the  distance  is  the  smallest.  The  output  of  the  program 
includes  the  within  covariance  matrices,  within  correlation  coefficients, 
pooled  covariance  matrix,  test  of  homogeneity  of  within  covariance 
matrices,  generalized  squared  distances,  and  a  classification  summary  or 
confusion  matrix.  The  tests  of  homogeneity  determine  whether  the 
criterion  is  based  on  the  within  group  covariance  matrices  of  the  pooled 
covariance  matrix  [31]. 


RESULTS  AND  DISCUSSION 

This  section  presents  the  results  of  the  discriminant  analysis 
programs  STEPDISC,  CANDISC,  and  DISCRIM.  The  STEPDISC  and  CANDISC  analysis 
were  used  to  gain  an  insight  as  to  which  combination  of  variables  would  be 
most  useful  in  revealing  the  differences  among  the  classes.  Then  the 
linear  discriminant  analysis  was  used  to  classify:  (1)  a  set  of  data,  the 
"training  set",  and  (2)  a  separate  set  of  data  not  used  in  the  training 
set,  the  "classification  set”  or  the  "test  set". 

STEPDISC  Results 

The  stepwise  linear  discriminant  analysis  was  performed  on  the 
expiratory  portion  of  respiratory  sound  data  for:(l)  4  classes  with  4 
variables,  (2)  3  classes  with  4  variables  for  each  sound  spectrum,  and  (3) 
3  classes  with  mean  values  of  the  4  variables  from  the  average  of  10  or 
more  spectra  for  a  subject.  The  4  classes  consisted  of  2  abnormal  groups 
and  2  normal  groups.  This  analysis  followed  one  of  the  possible  cluster 
analysis  groupings  found  by  Lessard  et  al.  [22].  The  3  class  cases 
consisted  of  2  abnormal  groups  and  1  normal  group.  Two  class  cases 
consisted  of  abnormal  and  normal  groups.  When  4  variables  were  used,  flow 
rate  was  included.  In  all  cases  the  3  respiratory  sound  spectral 
variables  ' MPF,  FPK,  and  FMAX  were  used. 

In  the  STEPDISC  analysis  of  4  classes  with  4  variables,  the  first 
variable  entered  was  the  mean  frequency  of  the  respiratory  spectrum  (MPF) 
because  of  its  high-squared  correlation  (R2)  value  (Fig.  2).  In  the 
second  step,  the  peak  frequency  (FPK)  was  entered  (Fig.  2),  and  m  the 
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STEPWISE 

SELECTION: 

STEP  1 

STATISTICS 

FOR  ENTRY. 

OF  •  3.258 

VARIABLE 

R**2 

F 

"PROB  >  F 

TOLERANCE 

MPF 

FPK 

FMAX 

FLOW 

0.6963 

0. 1788 
0.5672 
0.0647 

197 . 182 
18.722 
112.719 

5 . 947 

0.0001 

0.0001 

0.0001 

0.0007 

1  OOOO 

1  OOOO 

1  OOOO 

1  OOOO 

VARI ABLE  MPF  WILL  BE  ENTERED 

THE  FOLLOWING  VAR I  ABLE ( S  )  HAVE  BEEN  ENTERED- 
MPF 


WILKS'  LAMBDA 
PILLAIS  TRACE 


multivariate  statistics 

0.30369130  F(3,258)  »  197.182  PROB  > 

0.696309  F ( 3 , 258 )  ■  197  .182  PROB  > 

AVERAGE  SQUARED  CANONICAL  CORRELATION  »  0  23210290 


STEPWISE  SELECTION:  STEP 
STATISTICS  FOR  REMOVAL.  OF  • 
VARIABLE  R-*2  F 

MPF  0.6963  197.182 


2 

3.258 
PROB  >  F 

0.0001 


NO  VARIABLES  CAN  BE  REMOVED 


STATISTICS 

FOR  ENTRY. 

OF  •  3.257 

variable 

PARTIAL 

R*  *2 

F 

PROB  >  F 

TOLERANCE 

fpk 

fmax 

flow 

0. 1450 
0.0644 
0.0312 

14  527 

5  897 
2.760 

0  OOOl 

0  0008 
0.0420 

0  7277 

0 . 2806 

0  9652 

VARIABLE 

FPK  WILL  BE 

ENTERED 

THE  FOLLOWING  VARI ABLE ( S )  HAVE  BEEN  ENTERED: 

MPF  FPK 


MULTIVARIATE  STATISTICS 


WILKS  LAMBOA  *  0  25965893  F < 6 . 5  1 4 >  * 

PILLAIS  TRACE  *  0  806752  F16.516)  * 


82  450 
58  144 


PROB 

PROB 


AVERAGE  SQUARED  CANONICAL  CORRELATION  *  0  26391730 


F 

c 


Figure  2.  STEPDISC  result  of  4  classes  with  4  variables;  steps  1  and  2 
for  expiratory  data. 
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third  step,  the  flow  rate  was  entered  in  lieu  of  the  FMAX  (Fig.  3).  An 
examination  of  the  stepwise  selection  summary  indicates  that  the  squared 
correlation  value  (R2)  of  the  flow  variable  is  low  and  the  significance 
level  of  the  F  test  from  an  analysis  of  covariance  is  not  low  (0.0313), 
indicating  that  flow  may  not  contribute  a  great  amount  to  the 
discriminating  power  (Fig.  4). 

When  the  same  expiratory  sound  data  is  analyzed  by  the  STEPDISC 
program  for  the  3  classes  with  the  4  variables,  MPF  is  selected  in  the 
first  step  (Fig.  5)  and  FPK  is  selected  in  the  second  step  (Fig.  5).  Flow 
rate  and  FMAX  were  not  selected  because  the  significance  levels  of  the  F 
test  were  high  (Fig.  6),  if  a  third  variable  were  used  the  FMAX  would  be 
selected.  The  stepwise  selection  summary  is  given  in  Figure  6. 

In  the  last  case,  the  means  of  the  spectral  variables  and  the  mean 
flow  for  each  subject  were  used  as  input  data  into  the  STEPDISC  program 
instead  of  the  variables  from  each  quarter -second  spectrum.  Results  for 
the  3  classes,  4  variable  analysis  with  a  mean  data  set  (Appendix  E)  are 
given  in  Figures  7  through  9.  The  variables  selected  in  order  of 
discriminating  power  are  the  MPF,  FPK,  and  FMAX,  respectively.  Flow  was 
not  selected  as  a  discriminant  variable  as  indicated  in  the  selection 
summary  (Fig.  9). 

In  the  next  series  of  runs,  the  classes  were  reduced  to  2  groups 
(normal  and  abnormal)  with  4  variables  and  3  variables  (flow  rate 
excluded) .  In  the  STEPDISC  run  with  2  classes  with  4  variables  during 
expiration,  all  4  variables  were  entered  as  shown  in  Figures  10  through 
12.  For  this  case,  FPK  has  the  least  discriminant  power  (Fig.  12).  In 
the  case  of  2  classes  with  4  variables  during  inspiration,  the  MPF  was 
entered  in  the  first  step  (Fig.  13)  followed  by  the  FMAX  (Fig.  13).  The 
flow  rate  was  included  as  a  variable,  but  FPK  was  not  (Fig.  14). 
Selection  summary  is  given  in  Figure  15. 

In  the  case  of  2  classes,  with  3  variables  (flow  rate  excluded) 
during  expiration,  the  variables  FPK,  FMAX,  and  MPF  were  selected 
respectively  even  though  the  MPF  had  a  high  squared  correlation  (R2;  value 
and  a  F  test  significance  level  of  0.0001  as  is  shown  in  the  stepwise 
selection  summary  (Fig.  15).  For  inspiratory  data,  the  variables  MPF  and 
FMAX  were  selected;  and  the  FPK  was  not  entered  (Fig.  15). 

Table  1  is  a  summary  of  respiratory  sound  variable  discriminating 
power.  We  noted  that  the  MPF  is  the  variable  with  the  most  discriminating 
power  for  all  expiration  studies  except  the  2  classes,  4  variables  case. 
For  the  inspiration  studies,  the  FMAX  bandwidth  was  the  most 
discriminating  variable.  The  second  most  discriminating  variable  during 
expiration  is  the  FPK  or  mode  of  the  spectrum.  Flow  does  not  appear  to  be 
a  strong  discriminating  variable.  This  result  is  in  agreement  with 
results  presented  by  Wong  [34]  in  which  he  suggests  that  the  frequency 
spectra  of  respiratory  sounds  are  independent  of  respiratory  flow  rate  if 
the  flow  rate  is  about  or  greater  than  normal  respiratory  flow  rate  of 
about  0.75  to  1.0  L/s. 


STEPWISE  SELECTION:  STEP  3 
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Figure  3 


STATISTICS  FOR  REMOVAL.  DF  =  3.257 


VARIABLE 

MPF 

FPK 


PARTIAL 

R*«2 

0.6838 
0.  1450 


F 

185.271 

14.527 


PROS  >  F 

0.0001 

0.0001 


NO  VARIABLES  CAN  BE  REMOVED 


STATISTICS  FOR  ENTRY.  OF  »  3.256 
PARTIAL 

VARIABLE  R*  *2  F  PROB  >  F  TOLERANCE 

FMAX  0.0150  1.297  0.2752  0.1725 

FLOW  0.0338  2.987  0.0313  0.7153 

VARIA8LE  FLOW  WILL  BE  ENTERED 

THE  FOLLOWING  VARIABLE ( S )  HAVE  BEEN  ENTERED: 

MPF  FPK  FLOW 


MULTIVARIATE  STATISTICS 

LAMBDA  -  0.25087637  F<9.623.187)  =  52.974  PROB  >  F 

'S  TRACE  *  0.822477  F(9.774)  =  32.483  PROB  >  F 

AVERAGE  SQUAREO  CANONICAL  CORRELATION  *  0.27415892 
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STEPWISE  SELECTION:  STEP 
STATISTICS  FOR  REMOVAL.  DF  » 


VARIABLE 

MPF 

FPK 

FLOW 


PARTIAL 

R**2 

0  6792 
0. 1473 
0  0338 


F 

180.697 
14 , 740 
2.987 


3.256 


PROB  >  F 

0.0001 
0 . OOO 1 
0  0313 


NO  VARIABLES  CAN  BE  REMOVED 


STATISTICS  FOR  ENTRY.  DF  *  3.255 


VARIABLE 

PARTIAL 

R*  *2 

F 

PROB  >  F 

TOLERANCE 

FMAX 

0  0148 

1  275 

0  2829 

0  172  1 

NO  VARIABLES  CAN  BE  ENTERED 
NO  FURTHER  STEPS  ARE  POSSIBLE 

STEPDISC  result  of  4  classes  with  4  variables;  steps  3  and  4 
for  expiratory  data. 
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STEPWISE  SELECTION:  STEP  1 


STATISTICS  FOR  ENTRY.  DF  *  2.259 


VARIABLE 

R«*2 

F 

PROB  >  F 

TOLERANCE 

MPF 

0.5544 

161 . 147 

0.0001 

1 .0000 

FPK 

0.0876 

12.438 

0.0001 

1 .0000 

FMAX 

0.4775 

1  18 . 329 

0.0001 

1 .OOOO 

FLOW 

0.0272 

3.622 

0.0281 

1 .0000 

VARIABLE  MPF  WILL  BE  ENTERED 


THE  FOLLOWING  VARIABLE(S)  HAVE  BEEN  ENTERED: 
MPF 


MULTIVARIATE  STATISTICS 

WILKS'  LAMBDA  *  0.44555733  F(2.259)  *  161.147  PROS  >  F  =0 

PILLAI'S  TRACE  *  0.554442  F(2.259)  *  161.147  PROB  >  F  =0 

AVERAGE  SQUARED  CANONICAL  CORRELATION  *  0.27722108 


STEPWISE  SELECTION:  STEP  2 
STATISTICS  FOR  REMOVAL.  DF  =  2.259 
VARIABLE  R *  *2  F  PROB  >  F 

MPF  0.5544  161.147  0.0001 

NO  VARIABLES  CAN  BE  REMOVED 


STATISTICS 

FOR  ENTRY, 

DF  =  2.258 

PARTIAL 

VARIABLE 

R*  *2 

F 

PROB  >  F 

TOLERANCE 

FPK 

0. 1450 

21.874 

0 . 000 1 

0.7277 

FMAX 

0.0636 

8 . 769 

0 . 0002 

0  2806 

FLOW 

0.0016 

0.203 

0.8164 

0.9652 

VARIABLE 

FPK  WILL  BE 

ENTERED 

THE  FOLLOWING  VAR  I  ABLE ( S  )  HAVE  BEEN  ENTERED 
MPF  FPK 


MULTIVARIATE  STATISTICS 
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Figure  5.  STEPDISC  result  of  3  classes  with  4  variables;  steps  1  and 
for  expiratory  data. 
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Figure  6.  STEPDISC  result  of  3  classes  with  4  variables;  step  3  and  the 
selection  summary  for  expiratory  data. 
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Figure  7.  STEPDISC  result  of  3  classes  with  4  variables;  steps  1  and  2 
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Figure  8.  STEPDISC  result  of  3  classes  with  4  variables;  step  3  for  mean 
expiratory  data. 
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Figure  9.  STEPDISC  result  of  3  classes  with  4  variables;  step  4  and  the 
selection  summary  for  mean  expiratory  data. 
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Pigur«  10.  STEPDISC  result  of  2  classes  with  4  variables;  steps  1  and  2 
for  each  expiration. 
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Figure  11.  STEPDISC  result  of  2  classes  with  4  variables;  steps  3  and  4 
for  each  expiration. 
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Figure  12.  STEPDISC  result  of  2  classes  with  4  variables; 

step  5  and  the  selection  summary  for  each  expiration. 
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Figure  14.  STEPDISC  result  of  2  classes  with  4  variables;  steps  3  and  4 
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Table  1.  SUMMARY  OF  RANK  ORDER  OF  VARIABLE  DISCRIMINATING  POWER 


Resp. 

mode 

Number 

classes 

No. 

var. 

Data 
set  1 

Order 

2  3 

Ref 

4  Fig.  No. 

Exp 

4 

4 

Each 

MPF 

FPK 

FLOW 

2-4 

Exp 

3 

4 

Each 

MPF 

FPK 

5-6 

Exp 

3 

4 

Mean 

MPF 

FPK 

FMAX 

7-9 

Exp 

2 

4 

Each 

FMAX 

FLOW 

MPF 

FPK  10-12 

Ins 

2 

4 

Each 

FMAX 

MPF 

FLOW 

13-15 

Exp 

2 

3 

Each 

MPF 

FPK 

FMAX 

15 

Ins 

2 

3 

Each 

FMAX 

MPF 

15 

Exp  *  expiratory 
Ins  *  inspiratory 


CANDISC  Results 


The  canonical  discriminant  program  was  initially  run  on  the 
expiratory  sound  data  with  4  classes  and  4  variables.  The  means  of  the 
variables  for  a  subject  were  used  as  input  data  to  the  CANDISC  program. 
Table  2  shows  the  resulting  Mahalanobis  distances  (23)  between  classes 
0-3.  There  is  less  separation  between  abnormal  class  C  and  normal  class 
1,  than  between  the  2  normal  classes  1  and  3.  Abnormal  class  2  has  the 
greatest  separation  from  all  other  classes.  This  separation  may  also  be 
readily  seen  in  the  plot  of  canonical  variables  CAN1  vs.  CAN3  (Fig.  16). 


Table  2.  MAHALANOBIS  DISTANCE  BETWEEN  CLASSES 


Class 

0 

1 

2 

3 

0 

• 

3.58 

16.02 

2.92 

1 

3.58 

• 

13.24 

6.10 

2 

16.02 

13.24 

• 

18.88 

3 

2.92 

6.10 

18.88 

Note: 

Normal:  classes  1  &  3 
Abnormal:  classes  0  &>  2 


Table  3  is  a  summary  of  Mahalanobis  distances  (23)  between  classes 

for  the  2  class  cases  (normal  vs.  abnormal).  The  table  also  shows  that 

the  distance  between  groups  increases  when  the  variables  were  averaged  and 
were  the  greatest  when  the  average  flow  was  added  as  a  fourth  variable. 
This  result  appears  to  disagree  with  results  from  the  stepwise  linear 
analysis  which  did  not  include  flow  as  a  strong  discriminant  variable 
■except  one  case  during  inspiration  -  Fig.  15). 

DISCRIM  Results 

Before  running  the  discriminant  analysis  program  (DISCRIM),  the  data 
was  divided  into  2  subsets,  a  "training”  (contains  7  abnormals  and  5 

normals;  set  and  an  independent  "test”  set  for  classification  with  the 
discriminants  obtained  from  the  training  set  (Appendix  A,  direct  set).  In 
addition,  the  data  was  analyzed  with  the  sets  reversed,  i.e.,  the  training 
set  became  the  independent  test  set  and  the  test  set  was  used  as  the 


Figure  16.  Plot  of  canonical  variables  CAN1  vs.  CAN 3  from  CANDISC 
program  results  on  mean  expiratory  data  with  4  classes 
and  4  variables. 


Table  3.  SUMMARY  OF  MAHALANOBIS  DISTANCES  BETWEEN  CLASSES  FOR  2  CLASSES 
(NORMAL  AND  ABNORMAL) 

Expiration  Inspiration 


Each 

Mean 

Each 

Mean 

4  variables 

- 

2.0899 

- 

1.3738 

3  variables 

0.6481 

1.4475 

0.3963 

0.5310 

training  set.  Runs  with  this  data  set  are  referred  to  as  "reversed".  As 
before,  the  data  is  in  individual  (1/4-s  epoch)  spectral  variables  or  in 
averaged  variables  form.  The  terminology  "each"  is  used  when  discussing 
data  derived  from  the  individual  spectrum,  and  "mean"  when  discussing  data 
derived  from  the  average  variables  from  numerous  segments  of  breaths 
measured  from  a  subject.  Classification  summaries  of  training  and 
classification  test  data  sets  were  obtained  for  "each"  expiration 
(Appendix  A),  and  inspiration  (Appendix  B)  direct  and  reversed  data  sets. 

For  2  classes  (normal  and  abnormal),  3  parameters  from  each  spectrum 
of  expiration  (Appendix  A,  Direct  data  set),  Table  4  presents  the 
classification  summaries  for  the  training  and  the  classification  sets. 
The  number  of  correctly  classified  expiratory  breaths  from  the  training 
set  are  contained  in  the  main  diagonal,  e.g.,  170  and  89  =  259  correctly 
classified  out  of  348  or  about  74.4%;  off  diagonal  values  are 
classification  errors.  For  the  training  set,  54  abnormal  expirations  were 
misclassif ied  as  normal,  and  35  normal  expirations  were  misclassif ied  as 
abnormal  for  a  total  of  89  out  of  348  or  25.6%  error. 

Classification  of  the  expiratory  data  reversed  resulted  in  the 
summary  Table  5.  Approximately  40%  of  the  training  set  is  misclassif ied; 
this  error  increases  to  about  60%  for  new  data  (the  classification  test 
set) . 

Results  of  the  discriminant  analysis  of  "each"  inspiratory  sounds 
with  direct  and  reversed  data  sets  are  presented  in  Tables  6  and  7.  in 
both  cases  the  training  sets  have  fewer  misclassif ied  data  points  than  the 
classification  test  set.  Discriminant  results  are  a  little  better  with 
the  direct  data  set  than  with  reversed  data  set. 

In  addition,  classification  summaries  were  obtained  from  "mean"  data 
sets  given  in  Appendixes  C  and  D.  Tables  8  and  9  present  the 


Table  4.  CLASSIFICATION  SUMMARY  OF  EXPIRATORY  SOUND,  DIRECT  DATA  SET: 
EACH 

Training  Set 

Number  of  observations  classified  into  class: 


Class 

0 

1 

Total 

0 

170 

54 

224 

1 

35 

89 

124 

Total 

205 

143 

348 

Misclassif ied 

25.6% 

Classification  Set 

Number  of  observations  classified  into  class: 


Class 

0 

1 

Total 

0 

97 

68 

165 

1 

76 

55 

131 

Total 

173 

123 

296 

Misclassif ied 

48.6% 

Class  0  =  Abnormal,  Class  1  =  Normal. 


Table  5.  CLASSIFICATION  SUMMARY  OF  EXPIRATORY  SOUND,  REVERSED  DATA  SET: 
EACH 


Training  Set 

Number  of  observations  classified  into  class: 


Class 

0 

1 

Total 

0 

56 

109 

165 

1 

9 

122 

131 

Total 

.  65 

Misclassif ied 

231 

39.9% 

296 

Classification  Set 

Number  of  observations  classified  into  class: 


Class 

0 

1 

Total 

0 

61 

163 

224 

45 

79 

124 

Total 

106 

242 

348 

Misclassif ied 

59.8% 

Class  0  =  Abnormal,  Class  1  =  Normal. 
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Table  6.  CLASSIFICATION  SUMMARY  OF  INSPIRATORY  SOUND,  DIRECT  DATA  SET: 
EACH 


Training  Set 


Number  of  observations  classified  into  class: 


Class 


Total 


Total 


Misclassified  24.8% 


Classification  Set 


Number  of  observations  classified  into  class: 


Class 


Total 


Total 


Misclassified  54.1% 


Class  0  =  Abnormal,  Class  1  =  Normal. 


Table  7 


CLASSIFICATION  SUMMARY  OF  INSPIRATORY  SOUND,  REVERSED  DATA 
SET:  EACH 


classification  summaries  for  expiratory  sounds  with  direct  and  reversed 
data  sets  (Appendix  C).  The  results  of  classifying  new  data  are  about  the 
same  with  either  data  set.  The  overall  classification  accuracy  is  about 
80%.  Classification  summaries  for  inspiratory  sounds  with  direct  and 
reversal  data  sets  (Appendixes  D)  are  given  in  Tables  10  and  11.  In  both 
cases,  the  accuracy  for  the  training  sets  are  better  than  classification 
accuracy  for  the  test  sets.  The  overall  classification  accuracy  is  about 
67%  with  the  direct  and  about  80%  with  the  reversed  data  set. 

To  compare  results  obtained  with  the  respiratory  sound  parameters 
to  a  standard  pulmonary  classification  scheme,  3  pulmonary  function  test 
parameters  provided  by  U.S.  Air  Force  medical  center  were  used  to  classify 
the  subjects.  The  pulmonary  function  test  parameters  used  as  input 
variables  to  the  discriminant  analysis  program  were  forced  vital  capacity 
(FVC),  forced  expiratory  volume  at  1  s  (FEV1),  and  the  ratio  of  forced 
expiratory  volume  at  1  s  to  forced  vital  capacity  expressed  in  percentage 
( FEV1P) .  This  data  should  correspond  to  classification  with  the  mean  data 
set  since  the  pulmonary  function  test  is  per  subject  rather  than  per 
breath.  The  results  of  using  the  pulmonary  function  test  parameters  with 
direct  and  reversed  data  sets,  Appendix  E,  are  given  in  Tables  12  and  13. 
Note  the  similarity  in  results  with  the  "mean”  expiratory  data  (Tables  8 
and  9)  and  the  PFT  results  (Tables  12  and  13).  This  similarity  may  be 
explained  by  the  fact  that  the  pulmonary  function  test  is  per  subject  as 
is  the  mean  of  several  breaths,  and  that  the  pulmonary  function  test  is 
based  primarily  on  measurements  during  forced  expiration.  There  was  an 
overall  20.8%  error  in  classifying  patients  by  the  3  parameters  from  the 
pulmonary  function  tests.  The  error  may  be  caused  by  the  classes,  as  seen 
by  the  standard  3  parameters,  not  being  mutually  exclusive,  the  data  may 
be  incorrectly  classified,  or  not  enough  standard  variables  provided  for 
evaluation. 

Table  14  is  a  summary  of  all  the  discriminant  classification  runs. 
The  upper  half  of  the  table  presents  percentage  correctly  classified  and 
the  lower  half  presents  the  total  percent  error  in  classification.  As 
indicated  by  this  table,  classification  with  the  mean  data  resulted  in 
improved  accuracy.  The  improved  accuracy  may  be  the  result  of  better 
estimators  due  to  smoothing  of  variability  by  averaging. 

Regression  Analysis 

Multiple  regression  analysis  was  performed  to  determine  if  the 
spectral  parameters  for  discriminating  breath  sounds  correlate  with  the 
clinical  data  obtained  by  the  pulmonary  function  test.  In  general, 
regression  analysis  fits  least-square  estimates  to  a  linear  regression 
model  of  the  form: 

y_  =  W0  +  w.Xl.  +  W..X2.  ♦  -  +  t  -24 

where  y_  is  the  dependent  variable  that  can  be  predicted  by  a  linear 
combination  of  the  weighted  independent  variables  X's  for  the  observations 
i=l,  2,...,  n.  The  principle  of  least  squares  is  used  to  produce 
estimates  for  the  weighting  factors  (W's)  that  are  the  best  linear 
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Table  8.  CLASSIFICATION  SUMMARY  OF  EXPIRATORY  SOUND,  DIRECT  DATA  SET: 
MEAN 


Training  Set 

Number  of  observations  classified  into  class: 
Class  0  1  Total 


Total  8  4 

Misclassified  8.3% 


12 


Classification  Set 

Number  of  observations  classified  into  class: 
Class  0  1  Total 

0  5  2  7 

12  3  5 

Total  7  5  12 

Misclassified  33.3% 


Class  0  =  Abnormal,  Class  1  =  Normal. 
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Table  9.  CLASSIFICATION  SUMMARY  OF  EXPIRATORY  SOUND,  REVERSED  DATA  SET: 
MEAN 


Training  Set 


Class 


Number  of  observations  classified  into  class: 


Total 


Total 


Misclassif ied  16.7% 


Classification  Set 


Class 


Number  of  observations  classified  into  class: 


Total 


Total 


Misclassif led  25% 


Hass  0  =  Abnormal,  Class  1  =  Normal. 


m 


Table  10.  CLASSIFICATION  SUMMARY  OF  INSPIRATORY  SOUND,  DIRECT  DATA  SET: 
MEAN 

Training  Set 

Number  of  observations  classified  into  class: 

Class  0  1  Total 

0  6  17 

10  5  5 

Total  6  6  12 

Misclassif led  8.3% 

Classification  Set 

Number  of  observations  classified  into  class: 

Class  0  1  Total 

0  16  7 

114  5 

Total  2  10  12 

Misclassif led  58.3% 

Class  0  =  Abnormal,  Class  1  =  Normal. 


Table  11.  CLASSIFICATION  SUMMARY  OF  INSPIRATORY  SOUND,  REVERSED  DATA 
SET:  MEAN 


Training  Set 

Number  of  observations  classified  into  class: 
Class  0  1  Total 


Total  7  5  12 

Misclassif ied  0.0% 

Classification  Set 

Number  of  observations  classified  into  class: 
Class  0  1  Total 

0  7  0  7 


1  5  0 

Total  12  0 

Misclassif ied  41.7% 


5 

12 


Class  0  =  Abnormal,  Class  1  =  Normal. 
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Table  12.  CLASSIFICATION  SUMMARY  OF  PFT,  DIRECT  DATA  SET 


Class 

0 

Total 


Training  Set 

Number  of  observations  classified  into  class: 
0  1  Total 

7  0  7 

0  5  5 

7  5  12 

Misclassif ied  0.0% 


I 


Classification  Set 

Number  of  observations  classified  into  class: 
Class  0  1  Total 

0  7  0  7 

1  5  0-5 

Total  12  0  12 

Misclassif ied  41.7% 

Class  0  =  Abnormal,  Class  1  =  Normal. 
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Table  13.  CLASSIFICATION  SUMMARY  OF  PFT,  REVERSED  DATA  SET 


Training  Set 

Number  of  observations  classified  into  class 
Class  3  1  Total 

0  6  17 

10  5  5 

Total  6  6  12 

Misclassified  8.3% 

Classification  Set 

Number  of  observations  classified  into  class 
0  1  Total 

7  0  7 

4  1  5 

11  1  12 

Misclassified  33.3% 

Class  0  =  Abnormal,  Class  1  =  Normal. 


Class 

0 

1 

Total 


Table  14.  SUMMARY  OF  CLASSIFICATION  ACCURACY 


I 


Data  Set 
Type 
Each 

expiration 

Each 

inspiration 

Mean 

expiration 

Mean 

inspiration 

PFT 


Overall  Percentage  Correctly  Classified 

Direct  Reversed 


Train 

Test 

Overall 

Train 

Test 

Overall 

77.4 

51.4 

63.8 

60.1 

40.2 

49.4 

75.2 

45.9 

62.4 

64.6 

40.6 

51.2 

91.7 

66.7 

79.2 

83.3 

75.0 

79.2 

91.7 

41.7 

66.7 

100.0 

58.3 

79.2 

100.0 

58.3 

79.2 

91.7 

66.7 

79.2 

Overall  Percentage  Misclassif ied 


Each 

expiration 

25.6 

48.6 

36.2 

39.9 

59.8 

50.6 

Each 

inspiration 

24.8 

54.1 

37.6 

35.4 

59.4 

48.8 

Mean 

expiration 

8.3 

33.3 

20.8 

16.7 

25.0 

20.8 

Mean 

inspiration 

3.3 

58.3 

33.3 

0.0 

41.7 

20.8 

PFT 

3.0 

41.7 

2C.8 

8.3 

33.3 

20.8 
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unbiased  estimates  under  classical  statistical  assumptions.  The  regression 
programs  available  at  Texas  A&M  University  are  statistical  packages  within 
the  SAS  [31].  The  program  outputs  a  summary  of  the  statistics  for  each 
dependent  variable  to  each  independent  variable  or  combination  of 
independent  variables  and  plots  of  each  dependent  variable  vs.  each 
independent  variable.  Typical  plots  of  a  first  order  model  for  FVC  as  the 
dependent  variable  and  the  mean  frequency  of  the  spectral  parameter  (MPF) 
are  given  for  expiration  and  inspiration  in  Figures  17  and  13  respectively. 
In  both  cases,  the  predicted  linear  fit  are  almost  horizontal  lines 
indicating  that  there  is  almost  no  relationship  between  the  dependent  and 
independent  variables.  This  linear  fit  is  backed  up  by  the  low  R-square 
values  which  are  0.0025  and  0.0140.  Table  15  summarizes  the  R-square 
values  for  the  dependent  variables  and  the  independent  variables  used  in 
the  regression  model.  Expiratory  data  set  used  for  the  regression  analysis 
is  given  in  Appendix  F  and  the  inspiratory  data  set  in  Appendix  G.  In  all 
cases  the  low  values  of  R-square  results  indicates  that  this  is  minimal 
correlation  or  relationship  between  the  3  respiratory  sound  parameters  and 
the  3  pulmonary  function  test  parameters.  Thus,  pulmonary  function  test 
parameters  (i.e.,  FVC  or  FEV1  or  FEV1P)  cannot  be  predicted  by  any 
combination  of  respiratory  sound  parameters  (MPF,  FPK,  FMAX) . 


CONCLUSIONS 

The  objectives  of  this  study  were  to  determine  whether  respiratory 
sound  data  of  normal  volunteers  and  pulmonary  insufficient  subjects  can  be 
classified  by  DISCRIM  analysis,  and  to  determine  if  the  spectral  parameters 
for  discriminating  respiratory  sounds  correlate  with  the  clinical 
parameters  obtained  from  the  pulmonary  function  test.  Three  DISCRIM 

analysis  programs  and  a  regression  analysis  package  were  used  in  the  study. 

The  conclusions  from  the  DISCRIM  analysis  are: 

•  Variables  with  the  most  discriminating  power  are  MPF,  FPK,  and  FMAX. 
Flow  was  not  a  good  discriminator . 

•  Results  of  the  CANDISC  program  indicated  that  the  Mahalanobis 
distances  (23)  between  the  2  classes  (normal  and  abnormal)  increase 
when  variables  were  averaged  (i.e.,  mean  data  set). 

•  Classification  with  the  mean  data  is  better  than  classification  with 
data  based  on  each  breath.  This  data  may  be  expected  since  the  mean 
data  set  consist  of  estimators  that  have  been  smoothed  by  averaging 
thus  reducing  the  variability.  The  only  caution  with  accepting 
results  from  the  mean  data  is  the  low  number  of  samples  in  both 
normal  and  abnormal  classes  (12  each). 

•  Classification  with  the  pulmonary  function  test  parameters  indicates 
that  the  classes  may  not  have  been  classified  100%  correctly  by  the 
rater  or  raters.  This  error  in  classification  affects  the  outcome 
and  accuracy  of  the  discriminant  program. 

•  Overall  PFT  direct  rendered  19/24  =  79.2%  correct  and  5/24  =  20.8% 

error  which  is  the  same  as  mean  expiration  data  direct  (19/24  = 
79.2%  overall  correct).  In  addition  the  reverse  PFT  rendered  19/24 
=  79.2%  correct  and  5/24  =  20.8%  error  whereas  mean  expiration 
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during  inspiration. 


Table  15.  SUMMARY  OF  R-SQUARE  VALUES  OF  REGRESSION  MODELS 


Mean 

data: 

Expiration, 

N  = 

24 

No.  of 
ind.  var. 
in  model 

Independent 

variables 

FVC 

FEV1 

FEV1P 

1 

MPF 

0.0025 

0.0107 

0.0091 

1 

FPK 

0.0166 

0.0449 

0.0073 

1 

FMAX 

0.0070 

0.0132 

0.1063 

2 

MPF, FPK 

0.0168 

0.0450 

0.0310 

2 

FPK, FMAX 

0.0359 

0.0863 

0.1527 

2 

MPF, FMAX 

0.0621 

0.1638 

0.2303 

3 

MPF, FPK, FMAX 

0.0712 

0.1853 

0.2405 

Mean 

data: 

Inspiration 

r  N 

=  24 

1 

MPF 

0.0140 

0.0082 

0.0214 

1 

FPK 

0.0014 

0.0034 

0.0025 

1 

FMAX 

0.0108 

0.0008 

0.0712 

2 

MPF , FPK 

0.0201 

0.0085 

0.0298 

2 

FPK, FMAX 

0.0120 

0.0035 

0.0930 

2 

MPF, FMAX 

0.0147 

0.0410 

0.1859 

3 

MPF, FPK, FMAX 

0.0250 

0.0531 

0.1968 

reversed  set  also  rendered  19/24  =  79.2%  correct  and  5/24  =  20.8% 
error . 


Thus,  results  of  classification  with  expiratory  data 
approximates  results  of  classification  with  the  PFT  variables. 
This  stands  to  reason  since  the  pulmonary  function  test  is  based  on 
forced  expiration  measurements. 

Conclusions  from  the  regression  studies  are: 

•  The  respiratory  sound  variable,  peak  frequency  or  mode  of  the 
spectrum  (FPK)  contributes  the  least  to  multiple  regression. 

•  There  is  minimal  or  no  correlation  between  the  respiratory  sound 
spectral  parameters  (MPF,  FPK,  FMAX)  and  the  3  pulmonary  function 
test  parameters  (FVC,  FEV1,  FEV1P) . 


RECOMMENDATIONS 

The  results  of  the  DISCRIM  analysis  are  encouraging  from  the 
standpoint  that  classif ication  with  mean  expiratory  data  closely 
approximates  results  from  classification  with  the  3  pulmonary  function 
test  variables.  Much  has  been  learned  from  these  studies —  for  example, 
(l)better  controls  are  needed  during  collection  of  data  to  reduce  the 
variability;  (2)  since  cardiac  valve  sounds  may  be  heard  with  the 
respiration,  the  contribution  of  the  valve  sounds  must  be  examined  and 
eliminated,  or  reduced  to  decrease  variability;  and  (3)  other  parameters 
from  the  respiratory  spectra  must  be  added  and  evaluated. 

Each  subject  should  be  screened  with  a  full  pulmonary  function  test. 
These  results  should  be  classified  by  at  least  3  experts  in  pulmonary 
function.  Disagreements  should  not  be  used  in  the  initial  evaluation. 
Records  and  data  in  which  full  agreement  is  reached  initially  should  only 
be  used.  The  raters  should  be  interviewed  to  establish  which  parameters 
and  in  what  order  were  the  parameters  used  in  classification.  Twelve  or 
more  subjects  are  necessary  per  class.  To  start ,  the  classes  should  be 
mutually  exclusive.  Finally,  the  expansion  of  parameters  should  also 
include  expansion  of  respiratory  sound  parameters,  i.e.,  the  spectral 
energy  in  broad  frequency  bands  <100-150,  150-200,  etc',  the  second  and 

third  movements  of  the  distribution  about  zero.  In  short,  we  recommended 
that  the  study  be  repeated  under  contractor  control  from  start  to  finish 
as  soon  as  possible. 
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APPENDIX  A 


B  DATA  EXP; 

■  TITLE 

'POWER  SPECTRA 

DATA  ON  EXPIRATORY 

SOUND '  ; 

■  COMMENT  STATE : 

NORMAL 

=  1  ABNORMAL*  0; 

ft  COMMENT  DIRECT 

CALIBRATION  DATA; 

J  INPUT 

ID  S 

FVC 

FEVI 

FEV1P  STATE 

FLOW  MPF 

FPK  FMAX 

t 

1  CARDS; 

P 

73 

32 

44 

0 

1.14 

247.6 

96.0 

280.0 

P 

73 

32 

44 

0 

1.57 

275.6 

104.0 

248.0 

* 

P 

73 

32 

44 

0 

1.61 

192.4 

80.0 

272.0 

P 

73 

32 

44 

0 

1.51 

229.5 

80.0 

260.0 

1  P 

73 

32 

44 

0 

1.31 

300.5 

100.0 

324.0 

* 

P 

73 

32 

44 

0 

1.30 

180.6 

108.0 

260.0 

P 

73 

32 

44 

0 

1.13 

154.1 

80.0 

172.0 

P 

73 

32 

44 

0 

1.11 

352.9 

64.0 

260.0 

P 

73 

32 

44 

0 

1.05 

236.4 

60.0 

172.0 

P 

73 

32 

44 

0 

1.0 

365.4 

108.0 

264.0 

P 

73 

32 

44 

0 

1.17 

436.5 

172.0 

260.0 

P 

73 

32 

44 

0 

1.79 

341.7 

128.0 

1016.0 

P 

73 

32 

44 

0 

1.97 

367.9 

172.0 

620.0 

1  p 

73 

32 

44 

0 

1.91 

351.6 

148.0 

592.0 

1  P 

73 

32 

44 

0 

1.74 

365.4 

96.0 

1316.0 

P 

73 

32 

44 

0 

1.54 

286.6 

108.0 

560.0 

P 

73 

32 

44 

0 

1.31 

312.0 

104.0 

580.0 

P 

73 

32 

44 

0 

1.16 

450.3 

172.0 

1376.0 

!  p 

73 

32 

44 

0 

1.03 

367.6 

100.0 

420.0 

p 

73 

32 

44 

0 

0.95 

336.5 

172.0 

600.0 

!  p 

73 

32 

44 

0 

0.68 

484.6 

172.0 

1164.0 

73 

32 

44 

0 

1.21 

527.1 

172.0 

1376.0 

t  p 

73 

32 

44 

0 

1.54 

514.9 

160.0 

1416.0 

H  P 

73 

32 

44 

0 

1.60 

625.3 

156.0 

1376.0 

■  P 

73 

32 

44 

0 

1.64 

475.8 

208.0 

1360.0 

1 

73 

32 

44 

0 

1.57 

383.5 

148.0 

612.0 

gl  P 

73 

32 

44 

0 

1.40 

397.5 

172.0 

1256.0 

D  p 

73 

32 

44 

0 

1.32 

459.0 

160.0 

1332.0 

BJ  p 

73 

32 

44 

0 

1.12 

490.7 

172.0 

1288.0 

B  p 

73 

32 

44 

0 

1.08 

439.7 

172.0 

1336.0 

1  p 

73 

32 

44 

0 

0.65 

581.9 

172.0 

1276.0 

1  p 

73 

32 

44 

0 

1.44 

554.5 

180.0 

1440.0 

9  p 

73 

32 

44 

0 

1.88 

604.6 

120.0 

1388.0 

a  ? 

73 

32 

44 

0 

1.72 

478.9 

196.0 

1372.0 

5  p 

73 

32 

44 

0 

1.57 

364.1 

108.0 

532.0 

3j  p 

73 

32 

44 

0 

1.42 

271.9 

100.0 

636.0 

y  p 

73 

32 

44 

0 

1.34 

473.0 

172.0 

1452.0 

a  ? 

73 

32 

44 

0 

1.21 

436.6 

172.0 

1360.0 

* 

E  ? 

73 

32 

44 

0 

1.10 

379.1 

80.0 

1056.0 

R  p 

73 

32 

44 

0 

1.05 

565.2 

292.0 

1372.0 

E 

72 

53 

124 

0 

1.12 

14C.2 

92.0 

120.0 

E 

72 

53 

124 

0 

1.37 

90. 4 

92.0 

112.0 

| 

72 

58 

124 

0 

1.23 

241.7 

100.0 

140.0 

1 

72 

58 

124 

0 

1.19 

186.1 

80.0 

124.0 

ft 

72 

58 

124 

0 

1.29 

152.6 

112.0 

180.0 

s 

72 

58 

124 

0 

1.08 

209.7 

108.0 

148.0 

56 

C 

is 

2$ 

‘•'rj 

+ 

72 

58 

124 

0 

0.30 

190.6 

72.0 

140.0 

+ 

72 

58 

124 

0 

0.78 

129.3 

72.0 

180.0 

'  *J 

+ 

72 

58 

124 

0 

0.62 

116.4 

64.0 

128.0 

+ 

72 

58 

124 

0 

0.99 

121.2 

108.0 

188.0 

72 

58 

124 

0 

1.46 

133.1 

88.0 

128.0 

+ 

72 

58 

124 

0 

1.52 

135.6 

104.0 

188.0 

4* 

72 

58 

124 

0 

1.35 

138.4 

120.0 

164.0 

+ 

72 

58 

124 

0 

1.38 

114.4 

104.0 

152.0 

•*,;»» 

4- 

72 

58 

124 

0 

1.23 

133.0 

88.0 

192.0 

$S 

4- 

72 

58 

124 

0 

0.84 

124.6 

112.0 

156.0 

4* 

72 

58 

124 

0 

0.92 

127.7 

88.0 

148.0 

4- 

72 

58 

124 

0 

1.55 

110.4 

96.0 

136.0 

!‘f>* 

4“ 

72 

58 

124 

0 

1.42 

113.4 

96.0 

140.0 

4* 

72 

58 

124 

0 

1.65 

132.6 

96.0 

164.0 

i»M 

4- 

72 

58 

124 

0 

1.74 

112.4 

96.0 

172.0 

4 

72 

58 

124 

0 

1.31 

103.0 

104.0 

120.0 

38 

4 

72 

58 

124 

0 

1.03 

144.9 

104.0 

144.0 

$ 

4 

72 

58 

124 

0 

0.91 

123.1 

68.0 

176.0 

+ 

72 

58 

124 

0 

0.64 

106.4 

80.0 

112.0 

4 

72 

58 

124 

0 

0.71 

144.4 

88.0 

136.0 

4 

72 

58 

124 

0 

0.85 

110.2 

84.0 

132.0 

-*.V 

4 

72 

58 

124 

0 

0.90 

109.5 

84.0 

144.0 

4 

72 

58 

124 

0 

0.94 

109.5 

92.0 

144.0 

4 

72 

58 

124 

0 

1.01 

103.9 

104.0 

160.0 

•s& 

4 

72 

58 

124 

0 

1.07 

120.6 

112.0 

160.0 

4 

72 

58 

124 

0 

1.11 

118.5 

96.0 

168.0 

4 

72 

58 

124 

0 

1.04 

11/  .8 

84.0 

140.0 

1 

* 

99 

105 

105 

1 

0.69 

431.7 

112.0 

260.0 

w 

99 

105 

105 

1 

1.06 

413.2 

112.0 

124.0 

Mfi 

* 

99 

105 

105 

1 

1.22 

592.8 

260.0 

1408.0 

..*^1 

* 

99 

105 

105 

1 

X 

1.22 

431.0 

112.0 

260.0 

j 

99 

105 

105 

1 

1.14 

473.2 

108.0 

484.0 

4 

99 

105 

105 

1 

1.14 

373.5 

112.0 

172.0 

4 

* 

99 

105 

105 

1 

1.13 

S34 .5 

112.0 

1316.0 

4 

99 

105 

105 

1 

1.15 

519.9 

112.0 

260.0 

4, 

4 

99 

105 

105 

1 

1.18 

531.5 

112.0 

1036.0 

A/1 

4 

99 

105 

105 

1 

1.29 

603.6 

108.0 

1476.0 

4 

99 

105 

105 

1 

1.29 

394.7 

112.0 

124.0 

•ft 

4 

99 

105 

105 

1 

1.28 

476.2 

112.0 

776.0 

4 

99 

105 

105 

1 

x 

1.26 

478.9 

112.0 

560.0 

•V; 

W 

99 

105 

105 

X 

1.2C 

471.8 

112.0 

1280.0 

{.ft. 

/  M 

4 

99 

105 

105 

1 

1.18 

342.4 

108.0 

260.0 

4 

99 

105 

105 

1 

1.04 

536.3 

260.0 

1376.0 

4 

99 

105 

105 

X 

1.27 

459.5 

108.0 

256.0 

vj* 

4 

99 

105 

105 

1 

1.24 

562.2 

108.0 

1312.0 

4 

99 

105 

105 

1 

1.27 

535.5 

112.0 

1332.0 

•* 

4 

99 

105 

105 

X 

1.21 

543.2 

172.0 

1372.0 

4 

99 

105 

105 

1 

1.15 

560.3 

172.0 

1344.0 

j  . 

4 

99 

105 

105 

1 

1.08 

389.9 

108.0 

260.0 

*?A 

4 

99 

105 

105 

1 

1.07 

538.4 

172.0 

512.0 

4 

99 

105 

105 

1 

0.75 

575.6 

108.0 

1280.0 

K: 

4 

99 

105 

105 

X 

1.05 

523.8 

112.0 

260.0 

m 

.  v 

«  * 

& 

57 

« 

99 

105 

105 

1 

1.05 

484.6 

172.0 

1316.0 

* 

99 

105 

105 

1 

1.08 

361.0 

112.0 

172.0 

w 

99 

105 

105 

1 

1.13 

419.3 

108.0 

172.0 

n 

99 

105 

105 

1 

1.16 

463.6 

112.0 

260.0 

w 

99 

105 

105 

1 

1.07 

524.8 

112.0 

500.0 

3 

70 

74 

106 

0 

0.46 

386.5 

116.0 

1372.0 

•f 

70 

74 

106 

0 

0.64 

414.9 

112.0 

588.0 

3 

70 

74 

106 

0 

0.71 

404.2 

176.0 

652.0 

a 

70 

74 

106 

0 

0.30 

400.1 

92.0 

632.0 

a 

70 

74 

106 

0 

0.39 

378.8 

156.0 

724.0 

2 

70 

74 

106 

0 

0.95 

401.1 

132.0 

672.0 

S 

70 

74 

106 

0 

0.92 

391.3 

144.0 

1180.0 

a 

70 

74 

106 

0 

0.98 
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Expiration  Direct  Data  Set:  Each 


Test  Set 

7  Abnormal s  ( STATE =0 ) 
M 
C 
D 
G 
S 
B 
E 


5  Normals  (STATE=1) 
K 


DATA  EXP; 

TITLE  'POWER  SPECTRA  DATA 
COMMENT  STATE:  NORMAL =  I 
COMMENT  DIRECT  TEST  DATA; 


ON  EXPIRATORY  SOUND ' ; 
ABNORMAL=  0; 
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DATA  BREATH; 

TITLE  'POWER  SPECTRA  DATA  ON  INSPIRATORY  SOUND’; 
COMMENT  STATE:  NORMAL =  1  ABNORMAL®  0; 

COMMENT  DIRECT  TEST  DATA; 
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